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AE000154 11519 bp DNA linear BCT 01-DEC-2000 

Escherichia coli K12 MG1655 section 44 of 400 of the complete 
genome. 

AE000154 U00096 
AE000154. 1 

* 

Escherichia col i K12 
Escherichia col i K12 

Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacter i al es ; 
Enterobacter iaceae; Escherichia. 

1 (bases 1 to 11519) 

Blattner,F.R. , Plunkett,G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M., CoMado-Vides, J. , Gl asner, J. D. , Rode,C.K., Mayhew,G.F., 
Gregor,J., Davis, N.W., Kirkpatrick,H. A. , Goeden,M.A., Rose, D. J., 
Mau, B. and Shao, Y. 

The complete genome sequence of Escherichia coli K-12 

Science 277 (5331), 1453-1474 (1997) 

97426617 

"9278503 

2 (bases 1 to 11519) 
Blattner,F.R. 
Direct Submission 

Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mali, Madison, Wl 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

3 (bases 1 to 11519) 
Blattner,F. R. 
Direct Submission 

Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Malt, Madison, Wl 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

4 (bases 1 to 11519) 
Plunkett, G. III. 
Direct Submission 

Submitted (13-0CT-1998) Laboratory of Genetics, University of 
Wisconsin. 445 Henry Mall, Madison, Wl 53706, USA 
This sequence was determined by the E. coli Genome Project at the 
University of Wisconsin-Madison (Frederick R. Blattner, director). 
Supported by NIH grants HG00301 and HG01428 (from the Human Genome 
Project and NCHGR). The entire sequence was independently 
determined from E. coli K12 strain MG1655. Predicted open reading 
frames were determined using GeneMark software, kindly supplied by 
Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA, 
30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
Site Nos., unique ID nos. for the genes in the E. coll Genetic 
Stock Center (CGSC) database at Yale University, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
(http://cgsc.biology.yale.edu). Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
by correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coli 
Genome Project's World Wide Web site 

(http://www.genetics.wisc.edu). The E. coli K12 sequence and 
its annotations are periodically updated; this is version M54. No 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described In reference 1. The unique numeric identifiers 
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beginning with a lowercase ' b' assigned to each gene (protein- or 
RNA-encod i ng) are now designated as gene synonyms instead of 
labels. This should allow them to be searched for in Entrez as gene 
names. 

FEATURES Locat ion/Qual if iers 

source 1, . 1 1519 

/organi sm=" Escher i chi a coli K12" 

/strain=''K12|| 

/sub_stra in=" M61655" 

/db_xref="taxon:83333" 
[Komojex <1..26 

/note=" factor Sigma70: predicted +1 start at 499276" 
promoter compi ement (2. . 30) 

/note=" factor Sigma70; predicted +1 start at 499238" 
gene 106. .1410 

/gene=" gsk" 

/note=" synonym: b0477" 
CDS 106.. 1410 

/gene='' gsk" 
/EC^number="2.7. 1.73" 

/funct ion=" enzyme; Salvage of nucleosides and nucleotides" 
/note="o434; 100 pet identical to GB: EC0GSK_1 
ACCESSI0N:D0079&; 98 pet identical (1 gap) to tNGK.ECOLI 
SIV: P22937" 
/codon_start=l 
/trans l__tabl e=l 1 

/product=" inosine-guanos ine kinase" 
/protein_id=" AAC73579. 1" 
/db_xref="GI : 1786684" 

/t rans I a t i on=" MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVG I DQTLV 
D I EAKVDDEF I ERYGLSAGHSLV I EDDVAEALYQELKQKNL I THQFAGGT 1 GNTMHNY 
S VLADDRSVLLGVMCSN I E I GSYAYRYLCNTSSRTDLNYLQGVDGP I GRCFTL I GESG 
ERTFA I SPGHMNQLRAES I PEDV I AGASALVLTSYLVRCKPGEPMPEATMKA I EYAKK 
YNVPVVLTLGTKFV I AENPQWWQQFLKDHVS I LAMNEDEAEALTGESDPLLASDKALD 
WVDLVLCTAGP I GLYMAGFTEDEAKRKTQHPLLPGA I AEFNQYEFSRAMRHKDCQNPL 
RVYSH I APYMGGPEK I MNTNGAGDGALAALLHD 1 TANSYHRSNVPNS SKHKFTWLTYS 
SLAQVCKYANRVSYQVLNQHSPRLTRGLPEREDSLEESYWDR" 

repeat_region 1425. .1508 

/note="REP (repetitive extragenic palindromic) element; 
contains 2 REP sequences" 

gene complement (1 543. . 3219) 

/gene="ybaL" 
/note=" synonym: b0478" 

CDS complement (1543. . 3219) 

/gene='' ybaL" 

/funct ion=" putat i ve transport; Not classified" 
/note="f558; 100 pet identical to fragment YBAL_ECOLI 
SW:P39830 (70 aa) but has 6 additional C-terminal aa and 
485 additional N-terminal residues" 
/eodon_start=1 
/trans l_tabl e=l 1 

/product=" putat ive transport protein" 
/protein_(d="AAC73580. 1" 
/db„xref="GI :1786685" 

/t rans I at i on=" MHHATPL I TT I VGGLVLAF I LGMLANKLR I SPLVGYLLAGVLAG 
PFTPGFVADTKLAPELAELGV I LLMFGV6LHFSLKDLMAVKA I A I PGA I AQ I AVATLL 
GMALSAVLGWSLMTG I VFGLCLSTASTVVLLRALEERQL I DSQRGQ I A i GWL I VEDLV 
MVLTLVLLPAVAGMMEQGDVGFATLAVDMG ! T I GKV I AF I A I MMLVGRRLVPW I MARS 
AATGSRELFTLSVLALALGVAFGAVELFDVSFALGAFFAGMVLNESELSHRAAHDTLP 
LRDAFAVLFFVSVGMLFDPL I L I QQPLAVLATLA I I LFGKSLAAFFLVRLFGHSQRTA 
LT I AASLAQ I GEFAF I LAGLGHALNLLPQAGQNLVLAGA I LS I MLNPVLFALLEKYLA 
KTETLEEQTLEEA I EEEKQ I PVD I CNHALLVGYGR VGSLLGEKLLA5D I PL VV I ETSR 
TRVDELRERGVRAVLGNAANEE I MQLAHLECAKWL I LT I PNGYEAGE I VASARAKNPD 
I E I I ARAHYDDEVAY 1 TERGANQVVMGERE I ARTMLELLETPPAGEVVTG" 

P.L9.K0te_L complement (3328. . 3356) 

/note=" factor Sigma70; predicted +1 start at 502564" 

gene eomplement(3457. .4677) 

/gene=" fsr" 

/note=" synonym: b0479" 
CDS complement(3457. . 4677) 

/gene= fsr 
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/funct ion='' putat i ve transport; Not classified" 
/note="f406; 100 pet identical to FSR_ECOLI SW: P52067" 
/codon_start=l 
/transLtable=n 

/product=" fosmidomycin resistance protein" 
/protein_(d="AAC73581. 1" 
/db_xref=" 01:1786686" 

/t r ans I a t i on=" MAMSEQPQPVAGAAASTTKARTSFG I LGA I SLSHLLNDM 1 QSL I 
LA I YPLLQSEFSLTFMQ I CM I TLTFQLASSLLQPVVGYWTDKYPMPWSLP I GMCFTLS 
GLVLLALAGSFGAVLLAAALVGTGSSVFHPESSRVARMASGGRHGLAQSIFQVGGNFG 
SSLGPLLAAVI lAPYGKGNVAHfFVLAALLAl VVLAQISRWYSAQHRMNK6KPKATI IN 
PLPRNK VVLAVS I LL I L I FSKYFYMAS 1 SSYYTFYLMQKFGLS I QNAQLHLFAFLFAV 
AAGTV I GGPVGDK I GRKYV I WGS I LGVAPFTL I LPYASLHWTGVLTV 1 1 GF I LASAFS 
A I LVYAQELLPGR I GHVSGLFFGFAFGMGGLGAAVLGL I ADHTS I ELVYK I CAFLPLL 
GMLTIFLPDNRHKD" 

promotef complement (471 1. . 4738) 

/note=" factor Sigina70; predicted +1 start at 503947" 

promoter comp I ement (4742. . 4770) 

/note=" factor SiEnia70; predicted +1 start at 503978" 

promoter complement (4770. . 4797) 

/note=" factor Sigma70; predicted +1 start at 504006" 

promoter 4833.. 4861 

/note=" factor Sigma70; predicted +1 start at 504111" 

promoter complement (4839. . 4865) 

/note=" factor Sigma70; predicted +1 start at 504075" 

gene 4895., 6547 

/gene=" ushA" 
/no te=" synonym: b0480" 

CDS 4895.. 6547 

/gene=VushA" 
/EC_number="3.6. 1.45" 

/funct ion=" enzyme; Central intermediary metabol i sm: 
Sugar-nucleot ide biosynthesis, conversions" 
/note="o550; 99 pet identical to USHAJCOLI SW: P07024" 
/codon_star t=l 
/transl_table=ll 

/product="UDP-sugar hydrolase (5* -nuc I eot idase)" 
/protein_id="AAC73582. 1" 
/db.xref^^GI: 1786687" 

/ 1 rans I a t i on=" MKLLQRGVALALLTTFTLASETALAYEQDKTYK I TVLHTNDHHG 
HFWRNEYGEYGLAAQKTLVDG 1 RKEVAAEGGSVLLLSGGD I NTGVPESDLQDAEPDFR 
GMNLVGYDAMA I GNHEFDNPLTVLRQQEKWAKFPLLSAN I YQKSTGERLFKPWALFKR 
QDLK I AV I GLTTDDTAK 1 GNPEYFTD I EFRKPADEAKLV I QELQQTEKPD 1 1 1 AATHM 
GHYDNGEHGSNAPGDVEMARALPAGSLAM I VGGHSQDPVCMAAENKKQVDYVPGTPCK 
PDQQNG I W I VQAHEWGKYVGRADFEFRNGEMKMVNYQL I PVNLKKKVTWEDGKSERVL 
YTPE I AENQQM I SLLSPFQNKGKAQLEVK I GETNGRLEGDRDKVRFVQTNMGRL I LAA 
QMDRTGADFAVMSGGG i RDS I EAGD I S YKNVLK VQPFGN VVVYADMTGKEV I DYLTAV 
AQMKPDSGAYPQFANVSFVAKDGKLNDLK I KGEPVDPAKTYRMATLNFNATGGDGYPR 
LDNKPGYVNTGFIDAEVLKAYIQKSSPLDVSVYEPKGEVSWQ" 

gene coropi ement (6584. , 7063) 

/gene=" ybaK" 
/no te=" synonym: bOASV 

CDS complement (6584. , 7063) 

/gene="ybaK" 

/funct ion=" orf ; Unknown" 

/note="fl59; 90 pet identical amino acid sequence and 
equal length to YBAK_SALTY SW: P37174" 
/codon_star t=l 
/transl_table=11 

/product="orf , hypothetical protein" 
/protein„id="AAC73583. 1" 
/db_xref="GI: 1786688" 

/t r ans I a t i 0 n=" MTPAVKLLEKNK I SFQ I HTYEHDPAETNFGDEVVKKLGLNPDQV 
YKTLLVAVNGDMKHLAVAVTPVAGQLDLKKVAKALGAKKVEMADPMVAQRSTGYLVGG 
I SPLGQKKRLPT 1 1 DAPAQEFAT I YVSGGKRGLD I ELAAGDLAK 1 LDAKFAD I ARRD" 

promoter compi ement (721 1 .. 7239) 

/note=" factor Sigma70; predicted +1 start at 506447" 

gene complement (7267. . 8061) 

/gene-"ybaP" 
/not€=" synonym: b0482" 

CDS complement (7267. . 8061) 
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/gene="ybaP" 

/function=''putative enzyme; Not classified" 

/note="f264; This 264 aa ORF is 25 pet identical (10 gaps) 

to 174 residues of an approx. 976 aa protein SLA2_YEAST 

SW: P33338" 

/codon_start=l 

/transl_table=ll 

/product=''putat i ve ligase" 

/protein_id="AAC73584. 1" 

/db„xref=''GI : 1786689" 

/ 1 r ans I a t i on='' MDLLYRVKTLWAALRGNHYTWPA I D I TLPGNRHFHL I OS I HMGS 
HDMAPLPTRLLKKLKNADAL I VEADVSTSDTPFANLPACEALEER I SEEQLQNLQH I S 
QEMG I SPSLFSTQPLWQ t AMVLQATQAQKLGLRAEYG I DYQLLQAAKQQHKPV I ELEG 
AENQ I AMLLQLPDKGLALLDDTLTHWHTNARLLQQMMSWWLNAPPQNND I TLPNTFSQ 
SLYDVLMHQRNLAWRDKLRAMPPGRYVVAVGALHLYGEGNLPQMLR" 

promoter 7980. . 8007 

/note=" factor Sigma70; predicted +1 start at 507257" 

promoter 8056. . 8085 

/note=" factor Sigma70; predicted +1 start at 507335" 

gene 8145. . 8540 

/gene=''ybaQ'' 
/note=" synonym: b0483" 

CDS 8145. .8540 

/gene=''ybaQ" 

/f unct ion=" orf ; Unknown" 

/note="ol31; This 131 aa ORF is 38 pet identical (0 gaps) 
to 59 residues of an approx. 80 aa protein YSY1_SYNP7 SW: 
P3737r 

/codon_star t=l 
/trans l_tabl e=l 1 

/product="orf , hypothetical protein" 
/protein_id="AAC73585. 1" 
/db_xref="GI : 1786690" 

/ 1 rans I a t i on=" M I QYVLASLFTRKQQLKTMKQATRKPTTPGD I LLYEYLEPLDLK 

I NELAELLHVHRNSVSAL I NNNRKLTTEMAFRLAKVFDTTVDFWLNLQAAVDLWEVEN 

NMRTQEELGR I ETVAEYLARREERAKKVA" 
promoter complement (8153. .8179) 

/note='' factor Sigma70; predicted +1 start at 507389" 
repeat_region 8563. .8798 

/note="REP (repetitive extragenic palindromic) element; 

contains 5 REP sequences" 
gene complement (8856. . 11360) 

/gene=" ybaR" 

/note=" synonym: b0484" 
CDS compi ement (8856. . 1 1360) 

/gene=" ybaR" 

/f unct ion=" putat i ve enzyme; Not classified" 

/note="f834: This 834 aa ORF is 44 pet identical (16 gaps) 

to 735 residues of an approx. 752 aa protein ATCS_SYNP7 

SW: P37279" 

/codon_start=1 

/trans l_tabl e=l 1 

/product="putat I ve ATPase" 

/protein_id="AAC73586. 1" 

/db_xref="GI :178669r 

/translation="MSQTIDLTLDGLSCGHCVKRVKESLEQRPDVEQADVSITEAHVT 
GTASAEQL I ET I KQAGYDASVSHPKAKPLAESS I PSEALTAVSEALPAATADDDDSQQ 
LLLSGMSCASCVTRVQNALQSVPGVTQARVNLAERTALVMGSASPQDLVQAVEKAGYG 
AEA I EDDAKRRERQQETAVATMKRFRWQA I VALAVG I PVMVWGM I GDNMMVTADNRSL 
WLV I GL I TLAVMVFAGGHFYRSAWKSLLNGAATMDTLVALGTGVAWLYSMS VNLWPQW 
FPMEARHLYYEASAM 1 1 GL I NLGHMLEARARQRSSKALEKLLDLTPPTARLVTDEGEK 
SVPLAEVQPGMLLRLTTGDRVPVDGE I TQGEAWLDEAMLTGEP I PQQKGEGDSVHAGT 
VVQDGSVLFRASAVGSHTTLSR I I RMVRQAQSSKPE I GQLADK I SAVFVPVVVV I ALV 
SAAIWYFFGPAPQIVYTLVIATTVLI I ACPCALGLATPMS I I SGVGRAAEFGVLVRDA 
DALQRASTLDTVVFDKTGTLTEGKPQVVAVKTFADVDEAQALRLAAALEQGSSHPLAR 
A I LDKAGDMQLPQVNGFRTLRGLGVSGEAEGHALLLGNQALLNEQQVGTKA I EAE I TA 
QASQGATPVLLAVDGKAVALLAVRDPLRSDSVAALQRLHKAGYRLVMLTGDNPTTANA 
I AKEAG I DE V I AGVLPDGKAEA I KHLQSEGRQVAMVGDG I NDAPALAQADVG I AMGGG 
SDVA I ETAA I TLMRHSLMGVADALA I SRATLHNMKQNLLGAF I YNS ! G I PVAAG I LWP 
FTGTLLNPVVAGAAMALSS I TVVSNANRLLRFKPKE" 
promoter compI ement (1 1401 1 1431) 
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/note="factor SigmaTO; predicted +1 start at 510637" 
promoter complement (11438. . 1 1466) 

/note="factor Sigma70; predicted +1 start at 510674" 
promoter 11466. .11495 

/note="factor Sigma70: predicted +1 start at 510745" 
promoter 11504. .>11519 

/note=''factor Sigma70; predicted +1 start at 510782" 
BASE COUNT 2883 a 3112 c 2969 g 2555 t 
ORIGIN 

1 gaaaatgcga tcccgcctgc tgatattgaa actggctgcg tctcgcgcgc tcccgtcaga 
61 ttgtgttaac attcgccgct cagttaacca cccgtaaaaa caaccatgaa atttcccggt 
121 aaacgtaaat ccaaacatta cttccccgta aacgcacgcg atccgctgct tcagcaattc 
181 cagccagaaa acgaaaccag cgctgcctgg gtagtgggta tcgatcaaac gctggtcgat 
241 attgaagcga aagtggatga tgaatttatt gagcgttatg gattaagcgc cgggcattca 
301 ctggtgattg aggatgatgt agccgaagcg ctttatcagg aactaaaaca gaaaaacctg 
361 attacccatc agtttgcggg tggcaccatt ggtaacacca tgcacaacta ctcggtgctc 
421 gcggacgacc gttcggtgct gctgggcgtc atgtgcagca atattgaaat tggcagttat 
481 gcctatcgtt acctgtgtaa cacttccagc cgtaccgatc ttaactatct acaaggcgtg 
541 gatggcccga ttggtcgttg ctttacgctg attggcgagt ccggggaacg tacctttgct 
601 atcagtccag gccacatgaa ccagctgcgg gctgaaagca ttccggaaga tgtgattgcc 

661 ggagcctcgg cactggttct cacctcatat ctggtgcgtt gcaagccggg tgaacccatg 
721 ccggaagcaa ccatgaaagc cattgagtac gcgaagaaat ataacgtacc ggtggtgctg 
781 acgctgggca ccaagtttgt cattgccgag aatccgcagt ggtggcagca attcctcaaa 

841 gatcacgtct ctatccttgc gatgaacgaa gatgaagccg aagcgttgac cggagaaagc 

901 gatccgttgt tggcatctga caaggcgctg gactgggtag atctggtgct gtgcaccgcc 

961 gggccaatcg gcttgtatat ggcgggcttt accgaagacg aagcgaaacg taaaacccag 

1021 catccgctgc tgccgggcgc tatagcggaa ttcaaccagt atgagtttag ccgcgccatg 

1081 cgccacaagg attgccagaa tccgctgcgt gtatattcgc acattgcgcc gtacatgggc 

1141 gggccggaaa aaatcatgaa cactaatgga gcgggggatg gcgcattggc agcgttgctg 

1201 catgacatta ccgccaacag ctaccatcgt agcaacgtac caaactccag caaacataaa 
1261 ttcacctggt taacttattc atcgttagcg caggtgtgta aatatgctaa ccgtgtgagc 
1321 tatcaggtac tgaaccagca ttcacctcgt ttaacgcgcg gcttgccgga gcgtgaagac 

1381 agcctggaag agtcttactg ggatcgttaa gttatcgtcg gttcgtaggc cagataaggc 

1441 gttcacgccg catctggcat ttggctctcg atgcctgatg cgacgctggc gcgtcttatc 

1501 atgcctacat atttttcata ttttacatcc ggcaaccacc gtttaccccg tcaccacctc 

1561 acccgccggt ggcgtttcca gcagttccag catggtacgg gcgatttcac gctcgcccat 

1621 cactacctga ttcgcaccac gttcggtgat atacgccact tcatcgtcat aatgggcgcg 

1681 ggcaataatc tcaatatccg gatttttcgc gcgggcagat gccacaatct cacccgcttc 

1741 ataaccgttg ggaatcgtca ggatcagcca ttttgcacat tccagatgcg ccagttgcat 

1801 aatttcttcg ttcgccgcat tgcccaatac tgcgcggacc ccgcgctctc gcagctcatc 

1861 aacacgggtt cgtgacgtct caatcaccac cagcggaata tcagaggcga gcaatttctc 

1921 ccccagcagg ctgcctacac gaccgtaacc caccagtagc gcatggttgc aaatatccac 

1981 tgggatctgc ttctcttclt cgattgcctc ttccagcgtc tgctcttcca gcgtttcggt 

2041 cttcgccaga tatttctcca gtagtgcgaa cagtaccggg ttgagcataa tcgacaggat 

2101 cgcccctgcc agtaccaggt tttgtccggc ctgcggcagt aaattcaatg ccattcccag 

2161 tcccgccagg ataaacgcga actcaccaat ctgcgccagg ctggcggcga tggttaatgc 

2221 cgtacgttgg gagtgaccaa acagtcgcac caggaaaaat gcggctaacg acttaccaaa 

2281 cagaataatc gccagcgtcg ccagcactgc cagcggttgc tgaatcagaa ttaacggatc 

2341 aaacaacatc ccgacggaga caaaaaacag caccgcaaac gcgtcgcgca atggcagcgt 

2401 atcgtgggcg gcacggtgac tcagttcaga ctcgttcagt accatcccgg caaagaacgc 

2461 accgagtgca aaggagacat caaacagctc taccgcacca aaggcaaccc ctaacgccag 

2521 cgccagcacc gacagggtaa acagctcgcg agaaccggtt gccgcgctgc gtgccataat 

2581 ccacggcacc agacggcgac ctaccagcat cataatggcg ataaatgcga tcactttgcc 

2641 gatggtgatc cccatatcga ctgcaagagt ggcaaagccc acatcgccct gttccatcat 

2701 tcctgccact gcgggcagca acaccagcgt cagaaccatt accaggtctt ccacaatcaa 

2761 ccaaccgatg gcgatttgcc cacgctgact gtcaattaat tgccgttctt caagtgcgcg 

2821 cagtaacacc acggtactgg cggtggaaag acataaaccg aacacgatac cggtcattaa 

2881 cgaccagccc agcacggcag agagcgccat acccagcagc gtcgccacgg ctatctgggc 

2941 gatcgcaccg ggaatggcga tggcctttac cgccatcaaa tccttcagcg aaaagtgcaa 

3001 accgacgcca aacatcaaca gaatgacgcc cagttcagcc agttccgggg caagcttggt 

3061 atcggcaaca aagcccggag tgaatggtcc tgccagcaca cccgctaaca gatatcccac 

3121 cagaggagaa atacgtagtt tattggccag catgccgagg ataaaggcga gcacaaggcc 

3181 gccaacaatg gtggtgataa gcggggtggc gtgatgcatt ccgtctcctt ttcctggtgg 

3241 ttattgtcca tttttggccg ggaaaaccaa aattacaggt aatagtttat gacaatttca 

3301 ttgatgatgt tcatgaataa ttattgaatt ttgcagaaaa atggaattag ctgcaaaaaa 

3361 agcacggatc ggaaaacaga aggcggttta acagaggaaa ggcttatggc gactgtaaaa 

3421 cgatgcagcc aaagtttgcc tttggctgca atgaaatcag tctttatgcc ggttatcagg 

3481 caggaatatg gtcaacatcc ccaatagtgg caggaaagca cagattttat agactaactc 

3541 gatgctggtg tgatcggcga taagccccag aactgccgct cccagacctc ccatgccaaa 

3601 agcaaaaccg aaaaagagtc cagaaaccat accgatacgt cctggaagca gctcctgagc 

3661 gtagaccaga atggcagaga atgccgaagc gaggataaat ccaataatca ccgttaaaac 
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3721 ccccgtccag tgcaggctgg cgtagggtaa 

3781 agagccccaa atcacatatt tccgcccaat 

3841 acctgccgca acggcaaaca ggaaggcaaa 

3901 tccgaatttt tgcatcagat aaaaggtgta 

3961 tttcgagaaa atgaggatta acagaatgct 

4021 tggattgata atcgtcgctt tgggttttcc 

4081 acggctgatt tgcgccaaca ccacgatcgc 

4141 gttgcctttg ccataaggcg cgataatcac 

4201 aaagttgccg ccgacctgaa agatagattg 

4261 acgggccacg cgagaagatt ccggatgaaa 

4321 cgccagcaga actgcgccaa aactgcccgc 

4381 aaagcacatg ccaattggca acgaccatgg 

4441 cactggttgc agtagcgaag aggcgagctg 

4501 aaatgtcaga gaaaattctg actgaagcag 

4561 catgtcgttc agcagatgtg agaggctgat 

4621 cttggtcgtt gacgcagccg cgcccgccac 

4681 aaagtcactt tttcagggtt gcgatgtaaa 

4741 taacatacct gtatgcgtcg tctgaaggaa 

4801 tggatgcaga tttatcttca ccggacgcag 

4861 ttcaacacgt tgaaatcagg tcagggagag 

4921 gttagcgctg ttaaccacat ttacactggc 

4981 taaaacctac aaaattacag ttctgcatac 

5041 tgaatatggc gaatatggtc tggcggcgca 

5101 ggttgcggct gaaggcggta gcgtgctgct 

5161 gcccgagtct gacttacagg atgccgaacc 

5221 tgacgcgatg gcgatcggta atcatgaatt 

5281 ggaaaagtgg gccaagttcc cgttgctttc 

5341 gcgcctgttt aaaccgtggg cgctgtttaa 

5401 gctgacaacc gatgacacag caaaaattgg 

5461 tcgtaagccc gccgatgaag cgaagctggt 

5521 agacattatt atcgcggcga cccatatggg 

5581 cgcaccgggc gatgtggaga tggcacgcgc 

5641 cggtggtcac tcgcaagatc cggtctgcat 

5701 cgtgccgggt acgccatgca aaccagatca 

5761 tgagtggggc aaatacgtgg gacgggctga 

5821 ggttaactac cagctgattc cggtgaacct 

5881 aagcgagcgc gtgctttaca ctcctgaaat 

5941 atcaccgttc cagaacaaag gcaaagcgca 

6001 tcgtctggaa ggcgatcgtg acaaagtgcg 

6061 tctggcagcc caaatggatc gcactggtgc 

6121 tcgtgattct atcgaagcag gcgatatcag 

6181 cggcaatgtg gtggtgtatg ccgacatgac 

6241 cgtcgcgcag atgaagccag attcaggtgc 

6301 ggcgaaagac ggcaaactga acgaccttaa 

6361 aacttaccgt atggcgacat taaacttcaa 

6421 tgataacaaa ccgggctatg tgaataccgg 

6481 tatccagaaa agctcgccgc tggatgtgag 

6541 gcagtaatcc gaaagtgccg gatgtttgca 

6601 tatcagcaaa tttggcatcg aggatctttg 

6661 gtccgcgctt gccgccggaa acataaatag 

6721 tcgttggcag acgttttttc tgccccagtg 

6781 aacgctgcgc gaccatcgga tcggccatct 

6841 cttttttaag atccagttga cctgcgaccg 

6901 caccgttcac tgccaccagc agcgttttgt 

6961 tgacgacttc atcgccaaaa ttggtttcag 

7021 acgaaatctt gtttttttcg agtaatttaa 

7081 aaacaaaaaa acgaagcaag catacgcctg 

7141 ctaagagtat tggcaggatg gtgagattga 

7201 tgacggcata ataataaaaa aatgaaattc 

7261 atttttttag cgcaacattt gcggcaaatt 

7321 cgcgaccaca tatcgccccg gcggcatggc 

7381 ctgatgcatc agcacatcgt acagcgactg 

7441 attttgcggc ggtgcattca gccaccagct 

7501 atgccagtgg gtcagcgtat cgtccagcag 

7561 catggcaatc tggttttcag ccccttccag 

7621 cgcctgcaat agctggtaat cgataccgta 

7681 cgtcgcctga agaaccatcg cgatttgcca 

7741 aatgcccatc tcctggctaa tgtgctgcag 

7801 ttccagcgcc tcgcaggcag gcaaattagc 

7861 ctcgacgatc agcgcatcgg cgtttttgag 

7921 catatcgtgg ctacccatat gaatactgcc 



aatcagcgta aacggcgcaa cgccgaggat 

tttatcccct acaggcccgc cgatcaccgt 

cagatgaagc tgagcattct ggatagataa 

atagctgctg atgctcgcca tatagaaata 

gaccgccagt acaactttat tgcgcggcag 

tttattcatt cggtgctgtg ccgagtacca 

cagcagtgcc gcaagcacaa accaggcaac 

cgccgccagc aagggtccca gggaactgcc 

cgccaggcca tgccgcccgc cggaagccat 

gaccgatgaa ccggtaccga ccagcgccgc 

cagcgcaagc agcaccagac cgcttaaggt 

catcggatat ttatcggtcc agtagccgac 

gaaggtgagg gttatcatgc caatctgcat 

cggataaatc gccagaatca gcgattggat 

agcacctaaa ataccaaacg atgttcgggc 

aggctggggt tgttcactca ttgccatagg 

gaatgatctt atttgtgatt attaccagac 

gtctcaacgc cgaatacaga atttctaatc 

acttgtctat gatgtcgcgt catactattt 

aagtatgaaa ttattgcagc ggggcgtggc 

gagtgaaact gctctggcgt atgagcagga 

caatgatcat catgggcatt tttggcgcaa 

aaaaacgctg gtggatggta tccgcaaaga 

actttccggt ggcgacatta acactggcgt 

tgattttcgc ggtatgaatc tggtgggcta 

tgataatccg ctcaccgtat tacgccagca 

cgcgaatatc taccagaaaa gtactggcga 

gcgtcaggat ctgaaaattg ccgttattgg 

taacccggaa tacttcactg atatcgaatt 

gattcaggag ctgcaacaga cagaaaagcc 

gcattacgat aatggtgagc acggctctaa 

gctgcctgcc ggatcgctgg cgatgatcgt 

ggcggcagaa aacaaaaaac aggtcgatta 

acaaaacggc atctggattg tgcaggcgca 

ttttgagttt cgtaatggcg aaatgaaaat 

gaagaagaaa gtgacctggg aagacgggaa 

cgctgaaaac cagcaaatga tctcgctgtt 

gctggaagtg aaaataggcg aaaccaatgg 

ttttgtacag accaatatgg ggcggttgat 

cgactttgcg gtgatgagcg gaggcggaat 

ctataaaaac gtgctgaaag tgcagccatt 

cggtaaagag gtgattgatt acctgaccgc 

ctacccgcaa tttgccaacg ttagctttgt 

aatcaaaggc gaaccggtcg atccggcgaa 

tgccaccggc ggtgatggat atccgcgcct 

ctttattgat gccgaagtgc tgaaagcgta 

tgtttatgaa ccgaaaggtg aggtgagctg 

tccggcacaa tgcttaatcg cggcgggcga 

ccagatcgcc tgccgccagt tcgatatcca 

tggcaaattc ttgtgcgggg gcgtcgataa 

ggctaatccc cccaaccagg tatcccgtcg 

caactttctt ggcacccagc gcttttgcta 

gcgtaacggc cacggcaagg tgtttcatat 

agacctgatc cggattcaaa cctaattttt 

ccggatcgtg ctcgtaggta tggatttgaa 

ctgcgggtgt catggtaatt ttcctgcctt 

atggcggtcc aaaaaaaaga gtcatcttgc 

gcgacaatcg agttacaccg tcgcttaaag 

ctctttgacg ggccaatagc gatattggcc 

cccttctcca tacaggtgta gtgcaccgac 

gcgtaattta tcccgccagg cgagatttcg 

actgaacgta ttgggcagcg ttatatcatt 

catcatttgt tgcagcaacc gtgcgttggt 

cgccagtcct ttgtcaggga gctggagcaa 

ttcaatcacg ggtttatgtt gttgcttcgc 

ttctgcccgc agccccagtt tttgtgcctg 

cagcggttgg gtagaaaaga gtgagggaga 

gttttggagt tgctcctcgc taatgcgctc 

aaaaggcgta tcgctggtgg aaacatccgc 

ctttttgagc aaacgggtgg gcaggggagc 

aatcagatga aaatggcgat tgccggggag 



http://srs.ddbj.nig.ac jp/cgi-bin/wget2?-id+46Vfb1 LSmva+-e+[DDB JRELEASE:'AE(... 2003/06/1 9 



Enjtry Page 



7/7 



// 



7981 ggtgatatcg 

8041 tacccggtac 

8101 agggatggcg 

8161 ttgcatcgct 

8221 cgacgacacc 

8281 atgagttagc 

8341 atcgtaaact 

8401 atttttggct 

8461 cccaggaaga 

8521 gtgcaaaaaa 

8581 tcacgccgca 

8641 gcctacaaac 

8701 cgagcaccat 

8761 aaccgtaggc 

8821 cgccacccta 

8881 aaccggttgg 

8941 accgggttaa 

9001 atactgttgt 

9061 gaaatagcga 

9121 gtttcaatgg 

9181 gctggcgcgt 

9241 tgtttgatcg 

9301 gcttctttgg 

9361 tatcccgctt 

9421 actgccagca 

9481 gatgcctgag 

9541 agcgcctgat 

9601 cgcaatgtgc 

9661 gctcgtgcca 

9721 gcttcatcaa 

9781 gtcccggttt 

9841 tcccgcacca 

9901 gtcgccagcc 

9961 gtatagacaa 

10021 ataaccacca 

10081 ggcttgctgc 

10141 ccaaccgcac 

10201 ctatcgcctt 

10261 catgcttcgc 

10321 agcaacatac 

10381 aggcgtgccg 

10441 cgtgcgcgcg 

10501 tcgtaataaa 

10561 gaatagagcc 

10621 ttcagcaggc 

10681 agggttatca 

10741 tcgccgatca 

10801 cagcggaagc 

10861 tcatcttcaa 

10921 ggggaggcac 

10981 acgcccggta 

11041 ccgctcagca 

11101 gaaaccgctg 

11161 tggcttacag 

11221 gcagtcccgg 

11281 tgttcaagac 

11341 aggtcgatag 

11401 ggttaaacct 

11461 gtaaagatta 



atggcaggcc 
aacagatcca 
caaatggagt 
attcacaagg 
cggagatat t 
agagttgctg 
cactactgag 
aaacctccag 
attgggacgg 
ggtcgcgtaa 
tccggcatcc 
t tgtgcctga 
tgcctgatgc 
cggataaggc 
aagcagcgca 
cgttactcac 
gcagtgt tec 
agataaacgc 
gcgcatccgc 
caacatcact 
cgt taatgcc 
cttcggcttt 
cgatcgcat t 
ta^tgcaggcg 
gggctaccgc 
cagtaatctc 
tgcccagcaa 
ggaaaccgt t 
gcggatggct 
catcagcaaa 
tatcgaacac 
gcacgccaaa 
ccagcgcaca 
tctgcggtgc 
ctaccggcac 
tctgggcctg 
tggcacgaaa 
cgcctttttg 
cctgggtaat 
ctggc tgcac 
tcggcggggt 
ct tccagcat 
gatgtcgcgc 
acgccacgcc 
1 1 1 tccatgc 
ggccgataac 
tcccccagac 
get tcatcgt 
tcgct tccgc 
tgcccatcac 
cgctttgcag 
gcaactgctg 
tcagtgcttc 
atgcgtcata 
taacgtgcgc 
tttctttcac 
tttgtgacat 
tccagcaagg 
aaaaatcggc 



aggtgtaatg 
tacgacctcc 
tggagact tg 
aagcaacagt 
ct tctctatg 
catgt tcatc 
atggcatttc 
gcggcggttg 
attgaaacag 
ggccaaaaca 
gagcaccat t 
accgtaggcc 
gacgct tacg 
gt t tacgccg 
tccgcaatga 
tacggtaatc 
agtgaacggc 
accgagcagg 
aacgcccatc 
gccgccaccc 
gtcgcccacc 
accgtccggc 
ggcggtggtt 
t tgcagcgcc 
1 1 tcccgtca 
cgct tcgata 
taacgcatga 
gacctgcggt 
ggaaccttgc 
Igttttcact 
tacagtgtcg 
ctcagccgcc 
cggacaggca 
cggaccaaag 
aaatacggct 
gcgcaccatg 
cagcacactg 
ctgcgggatt 
ctcgccatct 
ttctgccaga 
taaatcgagt 
atggccgaga 
ttccatcggg 
agtacccagc 
actgcggtaa 
caaccacagg 
catcaccggg 
agcgacggcg 
gccgtagccc 
cagcgcagta 
cgcattttgt 
gctgtcatcg 
cgacgggatt 
acccgcttgt 
ttcagtgata 
gcgtttaacg 
aaaacactcc 
ggaaggtcaa 
tcgatttgca 



attaccgcgc 
ctttgtgaaa 
tttaatgtgt 
taaaaaccat 
aatatctgga 
gtaatagcgt 
gtctggcgaa 
atctttggga 
tggctgaata 
tcgcttcatc 
gcctgatgcg 
ggataaggcg 
cgtcttatca 
catccggcag 
tgtacttatt 
gacgagagcg 
cacaaaatac 
t tctgct tea 
aggctatggc 
atcgcaatgc 
at tgccacct 
agcaccccgg 
gggttatccc 
gccacgctat 
accgccagca 
gctttggtac 
ccttcagctt 
agctgcatat 
tccagtgccg 
gcgacaacct 
agtgtactgg 
cgcccgacgc 
ataatcagta 
aaataccaga 
gagattttat 
cgaatgattc 
ccgtcctgta 
ggt tcgcccg 
accggcacgc 
ggcacgcttt 
aacttt tcca 
t tgatcagac 
aaccactgcg 
gccaccagcg 
aaatggccgc 
ctgcggt tgt 
atacccaccg 
gtttcttgct 
gctttttcca 
cgctccgcca 
acgcgggtga 
tcatcggcgg 
gatgactccg 
ttgatggttt 
gacacatccg 
cagtgaccgc 
tttaagacag 
gaaat taata 
tcaggat tag 



agcgcggccc 
tatcatgcta 
ttgtatgatt 
gaaacaggca 
accgctcgat 
cagtgcactg 
agtttttgat 
agt tgaaaac 
tttggcacgc 
tcgtaggccg 
acgcttaccc 
1 1 tacgccgc 
ggcctacaaa 
tcatgcgtcg 
ccttcggtt t 
ccattgccgc 
cggcggcgac 
tgttgtgcag 
gcatcagggt 
cgacatccgc 
gacgtcc t tc 
cgatcacctc 
cggtcaacat 
cactacgcaa 
gcacaggcgt 
caacctgt tg 
caccgctcac 
cacctgcttt 
ccgccagacg 
gcggcttccc 
cgcgttgcag 
cggaaataat 
ccgtggtggc 
ttgccgcact 
ccgccagctg 
gtgacagcgt 
ccactgtccc 
tcagcatcgc 
gatcgccggt 
tttcaccttc 
gcgccttaga 
cgataatcat 
gccacaggtt 
tatccatcgt 
cggcgaaaac 
cagcggtgac 
ccagtgcgac 
ggcgctcgcg 
ccgcctgcac 
ggtttacccg 
cacagctggc 
tcgctgccgg 
ccagcggttt 
cgatcagctg 
cctgctcaac 
aggacaggcc 
ttttgactgg 
aaccaggcgg 
acatttatct 



aaagcgtttt 

gcgcgcggtg 

cagtatgt tc 

acaagaaaac 

ttgaaaatca 

atcaataaca 

accacagtcg 

aacatgcgca 

cgtgaagagc 

gataaggcgt 

gtcttatcag 

atccggcatc 

cctgtgcctg 

atgccaaatg 

aaaccgcagc 

tccggcaact 

cggaataccg 

cgt tgcgcgg 

aatcgccgcg 

ctgagccaec 

act ttgcaga 

atcaatccct 

caccagacga 

cggatcgcgt 

tgcccct tgc 

ctcatttaac 

gcccagcccg 

atcgaggatc 

caatgcctgc 

ttcagtcagc 

cgcgtcagcg 

cgacatcggc 

aatcaccagg 

gacaagcgca 

accgatt tct 

agtatggctg 

ggcatggacg 

t tcatccagc 

cgtcaggcgc 

gtcagtaacc 

agaacgctgg 

cgcgctggct 

gacgctcatc 

cgccgcaccg 

catcactgcc 

catcatgt ta 

aat tgcctgc 

gcgtttagcg 

taaatcttgt 

tgcctgagtg 

gcagctcatg 

aagcgcctca 

agcctttggg 

ttctgcactg 

atccggacgc 

gtccagggtc 

ctgtgataaa 

gtaaaagtcc 

ctttgtttt 
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